Advanced Tools for Video and Multimedia Mining

نویسندگان

  • Jia-Yu Pan
  • Christopher Olston
  • Shih-Fu Chang
چکیده

How do we automatically find patterns and mine data in large multimedia databases, to make these databases useful and accessible? We focus on two problems: (1) mining “uni-modal patterns” that summarize the characteristics of a data modality, and (2) mining “cross-modal correlations” among multiple modalities. Uni-modal patterns such as “news videos have static scenes and speech-like sounds”, and cross-modal correlations like “the blue region at the upper part of a natural scene image is likely to be the ‘sky”’, could provide insights on the multimedia content and have many applications. For uni-modal pattern discovery, we propose the method AutoSplit. AutoSplit provides a framework for mining meaningful “independent components” in multimedia data, and can find patterns in a wide variety of data modalities (e.g., video, audio, text, and time sequences). For example, in video clips, AutoSplit finds characteristic visual/auditory patterns, and can classify news and commercial clips with 81% accuracy. In time sequences like stock prices, AutoSplit finds hidden variables like “general growth trend” and “Internet bubble”, and can detect outliers (e.g., lackluster stocks). Based on AutoSplit, we design a system, ViVo, for mining biomedical images. ViVo automatically constructs a visual vocabulary which is biologically meaningful and can classify 9 biological conditions with 84% accuracy. Moreover, ViVo supports data mining tasks such as highlighting biologically interesting image regions, for biomedical research. For cross-modal correlation discovery, we propose MAGIC, a graph-based framework for multimedia correlation mining. When applied to news video databases, MAGIC can identify relevant video shots and transcript words for event summarization. On the task of automatic image captioning, MAGIC achieves a relative improvement of 58% in captioning accuracy as compared to recent machine learning techniques. Dedicated to my parents

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Review and Analysis of Multimedia Data Mining Tasks and Models

Over the past few decades, rapid changes in information technology have drastically changed the functions and activities of multimedia. Data mining has become more popular for extracting knowledge from multimedia data sets such as audio, video, speech, text, web, image and a combination of several types of these data sets. These are increasingly available and are semi-structured data or unstruc...

متن کامل

Tools, Techniques and Models for Multimedia Database Mining

Advances in multimedia acquisition and storage technology have led to tremendous growth in very large and detailed multimedia databases. Analyzing this huge amount of multimedia data to discover useful knowledge is a challenging problem. This challenge has opened the opportunity for research in Multimedia Data Mining (MDM). Multimedia data mining can be defined as the process of finding interes...

متن کامل

Extraction of object from the video

In this modern computer world, mining plays an important role to extract the required information from the information galaxy. The information galaxy is termed as Data Warehousing. Not only data, but also there are more fields which has huge amount of information in the galaxy, and cant be retrieved form it. For these cases, we need the mining techniques. Multimedia mining is one of the advance...

متن کامل

Discovering knowledge for better video indexing based on colors

In this paper, we present the discovery of rules for different challenges encountered in video indexing. These rules should be considered as knowledge that can be used as a guideline for the development of better indexing tools. We use a fuzzy decision tree to extract the rules based on color proportions of key-frames extracted from one single video-news. Experimental results and comparisons wi...

متن کامل

Adaptive Discovery of Indexing Rules for Video

This paper presents results, at an early stage of research work, of the use of fuzzy decision trees in a multimedia framework. We present the discovery of rules in three different indexing scenarios. These rules represent knowledge that can be interpreted as guidelines for the development of better indexing tools. We use a fuzzy decision tree algorithm to extract these rules (just) from color p...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006